Improving Fluency in a Sesotho Text-to-Speech Hybrid System

نویسندگان

Lehlohonolo Mohasi

Daniel Mashao

چکیده

Most of the present text-to-speech systems produce an acceptable quality speech output. Text-tospeech systems that are based on limited domain techniques produce speech that is close to human speech; however, they lack flexibility in that they cannot be used to synthesize words not in their own vocabulary. One approach of dealing with the flexibility problem is to use hybrid systems which combine limited domain systems and open vocabulary systems. This only solves part of the problem as discontinuities between words generated by different systems become apparent in the produced speech. In this paper, we improve the hybrid system by implementing techniques that can mask the discontinuities so that the output speech is more fluent. The proposed system was evaluated by carrying out subjective listening tests. In the tests, 20 listeners evaluated the quality of the speech output based on the MOS scoring system. The results showed an improvement on fluency with an overall score of 3.7 from 3.05.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Integrating Machine Translation and Speech Synthesis Component for English to Dravidian Language Speech to Speech Translation System

This paper provides an interface between the machine translation and speech synthesis system for converting English speech to Tamil text in English to Tamil speech to speech translation system. The speech translation system consists of three modules: automatic speech recognition, machine translation and text to speech synthesis. Many procedures for incorporation of speech recognition and machin...

متن کامل

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...

متن کامل

Developing a Corpus to Verify the Performance of a Tone Labelling Algorithm

We report on a study that involved the development of a corpus used to verify the performance of two tone labelling algorithms, with one algorithm being an improvement on the other. These algorithms were developed for speech synthesis purposes with the aim of improving the perceived naturalness as well as the intelligibility of the speech produced by the synthesizer. The corpus used to test the...

متن کامل

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

Recently permutation multimedia ciphers were broken in a chosen-plaintext scenario. That attack models a very resourceful adversary which may not always be the case. To show insecurity of these ciphers, we present a cipher-text only attack on speech permutation ciphers. We show inherent redundancies of speech can pave the path for a successful cipher-text only attack. To that end, regularities ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Improving Fluency in a Sesotho Text-to-Speech Hybrid System

نویسندگان

چکیده

منابع مشابه

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

Integrating Machine Translation and Speech Synthesis Component for English to Dravidian Language Speech to Speech Translation System

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

Developing a Corpus to Verify the Performance of a Tone Labelling Algorithm

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

عنوان ژورنال:

اشتراک گذاری